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Abstract: The variable \/s min was originally proposed in [1] as a model- independent, global 
and fully inclusive measure of the new physics mass scale in missing energy events at hadron 



■ colliders. In the original incarnation of y/s min , however, the connection to the new physics 

mass scale was blurred by the effects of the underlying event, most notably initial state 
radiation and multiple parton interactions. In this paper we advertize two improved variants 
of the y/s min variable, which overcome this problem. First we show that by evaluating the 
\/s min variable at the RECO level, in terms of the reconstructed objects in the event, the 
effects from the underlying event are significantly diminished and the nice correlation between 
the peak in the \/^min distribution and the new physics mass scale is restored. Secondly, 
the underlying event problem can be avoided altogether when the \fs min concept is applied 
to a subsystem of the event which does not involve any QCD jets. We supply an analytic 
formula for the resulting subsystem \fs^^ variable and show that its peak exhibits the 
usual correlation with the mass scale of the particles produced in the subsystem. Finally, 
we contrast y/s min to other popular inclusive variables such as Ht, MT ge n and MxTgen- We 
illustrate our discussion with several examples from super symmetry, and with dilepton events 
from top quark pair production. 
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1. Introduction and motivation 

1.1 The need for a universal, global and inclusive mass variable 

It is generally believed that missing energy signatures offer the best bet for discovering new 
physics Beyond the Standard Model (BSM) at colliders. This belief is reinforced by the dark 
matter puzzle - the Standard Model (SM) does not contain a suitable dark matter candidate. 
If dark matter particles are produced at colliders, they will be invisible in the detector, 
and will in principle lead to missing energy and missing momentum. However, at hadron 
colliders the total energy and longitudinal momentum of the event are unknown. Therefore, 
the production of any invisible particles can only be inferred from an imbalance in the total 
transverse momentum. The measured missing transverse momentum Jfx then gives the sum 
of the transverse momenta of all invisible particles in the event. 

Unfortunately, fx is the only measured quantity directly related to the invisible parti- 
cles. Without any further model-dependent assumptions, it is in general very difficult if not 
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impossible to make any definitive statements about the nature and properties of the miss- 
ing particles. For example, leaving all theoretical prejudice aside, one would not be able to 
answer such basic and fundamental questions like [1-5]: How many invisible particles were 
produced in the event? Are all invisible particles SM neutrinos, or are there any new neutral, 
stable, weakly-interacting massive particles (WIMPs) among them? What are the masses 
of the new invisible particles? What are their spins? What are the masses of any (parent) 
particles which may have decayed to invisible particles? 

The recent literature is abundant with numerous proposals 1 on how under particular cir- 
cumstances one might be able to measure the masses of the invisible particles. Unfortunately, 
all of the proposed methods suffer from varying degrees of model-dependence 2 : 

• Limited applicability topology-wise. Most methods are model-dependent in the sense 
that each method crucially relies on the assumption of a very specific event topology. 
One common flaw of all methods on the market is that they usually do not allow any 
SM neutrinos to enter the targeted event topology, and the missing energy is typically 
assumed to arise only as a result of the production of (two) new dark matter parti- 
cles. Furthermore, each method has its own limitations. For example, the traditional 
invariant mass endpoint methods [10-20] require the identification of a sufficiently long 
cascade decay chain, with at least three successive two-body decays [21]. The polyno- 
mial methods of Refs. [22-29] also require such long decay chains and furthermore, the 
events must be symmetric, i.e. must have two identical decay chains per event, or else 
the decay chain must be even longer [21]. The recently popular Mt2 methods [30-39] 
do not require long decay chains [21], but typically assume that the parent particles are 
the same and decay to two identical invisible particles 3 . The limitations of the Mqt 
methods [40-42] are rather similar. The kinematic cusp method [43] is limited to the 
so called "antler" event topology, which contains two symmetric one-step decay chains 
originating from a single s-channel resonance. In light of all these various assumptions, 
it is certainly desirable to have a universal method which can be applied to any event 
topology. To the best of our knowledge, the only such method in the literature is the 
one proposed in Ref. [1], where the y/s min variable was first introduced. The \/s min 
variable is defined in terms of the total energy E and 3-momentum P observed in the 
event, and thus does not make any reference to the actual event topology. In this sense 
y/s min is a universal variable which can be applied under any circumstances. 

• Limited applicability signature- wise. As a rule, most of the proposed methods work well 
only if the corresponding signature contains some minimum number of high px isolated 
leptons. Leptonic signatures have the twofold advantage of lower SM backgrounds 
and good lepton momentum measurement. The performance of the methods typically 
deteriorates as we lower the number of leptons in the signature. The most challenging 

1 See Ref. [6] for a recent review. 

2 Worse still, there are even fewer ideas for measuring the spins of the new particles in a truly model- 
independent fashion [7-9]. 

3 See [3,4] for a more general approach which avoids this assumption. 
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signature of multijets plus Et has rarely been studied in relation to mass and spin 
measurements (see, however [33,44-47]). Unfortunately, at hadron colliders like the 
Tevatron and LHC, one typically expects strong production to dominate the new physics 
cross-sections, and this in turn guarantees the presence of some minimum number of 
jets in the signature. At the same time, a priori there are no theoretical arguments 
which would similarly guarantee the presence of any hard isolated leptons. Therefore, 
one would like to have a general, sufficiently inclusive method, which treats jets and 
leptons on an equal footing. The \fs min method of Ref. [1] satisfies this requirement as 
well, since it does not differentiate between the type of reconstructed objects. In fact, 
the original proposal of Ref. [1] did not require any object reconstruction at all, and 
used (muon-corrected) calorimeter energy measurements to define the observed E and 
P in the event. 

• Combinatorics problem. Even if one correctly guesses the new physics event topology, 
and the signature happens to be abundant in hard isolated leptons, one still has to face 
the usual combinatorics problem of how to properly associate the reconstructed objects 
with the individual particles in the assumed event topology. Here we shall be careful 
to make the distinction between two different aspects of the combinatorics problem: 

— Partitioning ambiguity. As a prototypical example, consider a model of supersym- 
metry (SUSY) in which i?-parity is conserved and the lightest super symmetric 
particle (LSP) is neutral and stable. Each SUSY event contains two independent 
cascade decay chains, so first one must decide which reconstructed objects belong 
to the first decay chain and which belong to the second [32,48]. However, a pri- 
ori there are no guiding principles on how to do this partitioning into subsets. 
The decision is further complicated by the inevitable presence of jets from initial 
state radiation, which have nothing to do with the SUSY cascades [49]; by final 
state radiation, which modifies the assumed event topology; and by the occasional 
overlapping of jets [50]. 

— Ordering ambiguity. Having separated the objects into two groups, one must still 
decide on the sequential ordering of the reconstructed objects along each decay 
chain. One well-known example of this problem is the ambiguity between the 
"near" and "far" lepton in the standard jet-lepton-lepton squark decay chain [20]. 

The severity of either one of these two combinatorics problems depends on the type 
of signature — simple signatures resulting from short decay chains suffer from less 
combinatorics but tend to have larger SM backgrounds. By the same token, more 
complex signatures, which result from longer decay chains, are easier to see over the 
SM backgrounds, but very quickly run into severe combinatorial problems. Thus ideally 
one would like to have a method which treats all objects in the event in a fully inclusive 
manner, so that neither of these two combinatorial issues can ever arise at all. The 
\fs min variable of Ref. [1] was proposed for exactly this reason, and is free of the 
partitioning and ordering combinatorial ambiguities. 
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• Limited use of the available experimental information. At hadron colliders, events with 
invisible particles in the final state present an additional challenge: the total energy 
and longitudinal momentum of the initial state in the event are unknown. On the other 
hand, the transverse momentum of the initial state is known, which has greatly moti- 
vated the use of transverse variables like the missing "transverse energy" the scalar 
sum of transverse momenta H?, the transverse mass My, the stransverse mass M?2 [30], 
the contransverse mass Mqt [40], etc. An unsettling feature of a purely transverse kine- 
matical approach is that it completely ignores the measured longitudinal momentum 
components of the visible particles. In principle, the longitudinal momenta also carry a 
certain amount of information about the underlying physics, although it is difficult to 
see immediately how this information can be utilized. (For example, one cannot take 
advantage of longitudinal momentum conservation, because the longitudinal momen- 
tum of the initial state is unknown.) By defining the \fs min variable in a manifestly 
1+3 Lorentz invariant way, Ref. [1] proposed one possible way to utilize the additional 
information encoded in the measured longitudinal momenta. 

The above discussion makes it clear that the method of the y/s min variable has several 
unique advantages over all other known methods: it is completely general and universal, is 
fully inclusive, and to the fullest extent makes use of the available experimental information. 
In spite of these advantages, the ^fs min variable has not yet found wide application. The one 
major perceived drawback of \fs min is its sensitivity to initial state radiation (ISR) and/or 
multiple parton interactions (MPI) [1,6,51-53]. To see how this comes about, let us first 
review the formal definition of \/s min . 

1.2 Definition of Js min 

Consider the most generic missing energy event topology shown in Fig. |j. As seen from the 
figure, in defining \fs min , one imagines a completely general setup - each event contains some 
number n v i s of Standard Model (SM) particles Xi, z = 1,2,..., n v i s , which are visible in the 
detector, i.e. their energies and momenta are in principle measured. Examples of such visible 
SM particles are the basic reconstructed objects, e.g. jets, photons, electrons and muons. The 
visible particles Xi are denoted in Fig. [I] with solid black lines and may originate either from 
ISR, or from the hard scattering and subsequent cascade decays (indicated with the green- 
shaded ellipse). In turn, the missing transverse momentum fa arises from a certain number 
ni nv of stable neutral particles Xii * = 1,2, .. . ,nj n „, which are invisible in the detector. In 
general, the set of invisible particles consists of some number n x of BSM particles (indicated 
with the red dashed lines), as well as some number n v = ni nv — n x of SM neutrinos (denoted 
with the black dashed lines). As already mentioned earlier, the fa measurement alone does 
not reveal the number rij n „ of missing particles, nor how many of them are neutrinos and 
how many are BSM (dark matter) particles. This general setup also allows the identities and 
the masses of the BSM invisible particles Xi-> (i = 1> 2, . . . , n x ) in principle to be different, 
as in models with several different species of dark matter particles [54-57]. Of course, the 
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Figure 1: The generic event topology used to define the y/s min variable in Ref. [1]. Black (red) 
lines correspond to SM (BSM) particles. The solid lines denote SM particles Xi, i = 1,2, ... ,n V i s , 
which are visible in the detector, e.g. jets, electrons, muons and photons. The SM particles may 
originate either from initial state radiation (ISR) , or from the hard scattering and subsequent cascade 
decays (indicated with the green-shaded ellipse). The dashed lines denote neutral stable particles Xii 
i = i, 2, . . . , rii nv , which are invisible in the detector. In general, the set of invisible particles consists 
of some number n x of BSM particles (indicated with the red dashed lines), as well as some number 
n v = rii nv — n x of SM neutrinos (denoted with the black dashed lines). The identities and the masses 
rtii of the BSM invisible particles %i, (i = 1, 2, . . . , n x ) do not necessarily have to be all the same, i.e. we 
allow for the simultaneous production of several different species of dark matter particles. The global 
event variables describing the visible particles are: the total energy E, the transverse components P x 
and P y and the longitudinal component P z of the total visible momentum P. The only experimentally 
available information regarding the invisible particles is the missing transverse momentum Pr. 

neutrino masses can be safely taken to be zero 

rrii = 0, for i = n x + l,n x + 2, . . . ,m nv . (1.1) 

Given this very general setup, Ref. [1] asked the following question: What is the minimum 
value \fs min of the parton-level Mandelstam invariant mass variable y/s which is consistent 
with the observed visible 4-momentum vector P 11 = (E, P)? As it turned out, the answer to 
this question is given by the universal formula [1] 

V~s mm (W) = x/E 2 -P? + ^/W 2 +Fl , (1-2) 
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where the mass parameter $4 is nothing but the total mass of all invisible particles in the 
event: 

riinv n x 
i=l i=l 

and the second equality follows from the assumption of vanishing neutrino masses ( |1.1|) . The 
result ( |1.2[) can be equivalently rewritten in a more symmetric form 

yTsrninW) = \j M 2 + Pj< + \J W 2 + Ft M 

in terms of the total visible invariant mass M defined as 

M 2 = E 2 — P 2 - P 2 - P 2 = E 2 -P^- P 2 . (1.5) 

Notice that in spite of the complete arbitrariness of the invisible particle sector at this point, 
the definition of ^fs min depends on a single unknown parameter }A - the sum of all the 
masses of the invisible particles in the event. For future reference, one should keep in mind 
that transverse momentum conservation at this point implies that 

P T + Ft = 0. (1.6) 

The main result from Ref. [1] was that in the absence of ISR and MPI, the peak in the 
\fs min distribution nicely correlates with the mass threshold of the newly produced parti- 
cles. This observation provides one generic relation between the total mass of the produced 
particles and the total mass pi of the invisible particles. Based on several SUSY examples 
involving fully hadronic signatures in symmetric as well as asymmetric topologies, Ref. [1] 
showed that the accuracy of this measurement rivals the one achieved with the more tradi- 
tional Mt2 methods. 

1.3 y/s min and the underlying event problem 

At the same time, it was also recognized that effects from the underlying event (UE), most 
notably ISR and MPI, severely jeopardize this measurement. The problem is that in the 
presence of the UE, the y/s min variable would be measuring the total energy of the full 
system shown in Fig. |l], while for studying any new physics we are mostly interested in the 
energy of the hard scattering, as represented by the green-shaded ellipse in Fig. |l[ The 
inclusion of the UE causes a drastic shift of the peak of the \fs min distribution to higher 
values, often by as much as a few TeV [1,51,52]. As a result, it appeared that unless effects 
from the underlying event could somehow be compensated for, the proposed measurement of 
the \/s min peak would be of no practical value. 

The main purpose of this paper is to propose two fresh new approaches to dealing with 
the underlying event problem which has plagued the \fs min variable and prevented its more 
widespread use in hadron collider physics applications. But before we discuss the two new 
ideas put forth in this paper, we first briefly mention the two existing proposals in the 
literature on how to deal with the underlying event problem. 
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First, it was recognized in Ref. [1] that the contributions from the underlying event tend 
to be in the forward region, i.e. at large values of \rj\. Correspondingly, by choosing a suitable 
cut \rj\ < r] max , designed to eliminate contributions from the very forward regions, one could 
in principle restore the proper behavior of the y/s min distribution [1]. Unfortunately, there 
are no a priori guidelines on how to choose the appropriate value of rj max , therefore this 
approach introduces an uncontrollable systematic error and has not been pursued further in 
the literature. 

An alternative approach was proposed in Refs. [51,52], which pointed out that the ISR 
effects on \fs min are in principle calculable in QCD from first principles. The calculations 
presented in Refs. [51,52] could then be used to "unfold" the ISR effects and correct for the 
shift in the peak of the \/~s min distribution. Unfortunately, in this analytical approach, the 
MPI effects would still be unaccounted for, and would have to be modeled and validated 
separately by some other means. While such an approach may eventually bear fruit at some 
point in the future, we shall not pursue it here. 

We see that, for one reason or another, both of these strategies appear unsatisfactory. 
Therefore, here we shall pursue two different approaches. We shall propose two new variants 
of the \fs min variable, which we label \/^mira an< ^ V^min an d define in Sees. || and ||| 
correspondingly. We illustrate the properties of these two variables with several examples 
in Sees. SB. These examples will show that both v^mirT^ and y/sf^ are unharmed by the 



effects from the underlying event, thus resurrecting the original idea of Ref. [1] to use the peak 
in the \fs min distribution as a first, quick, model-independent estimate of the new physics 
mass scale. In Section [7| we compare the performance of y/s min against some other inclusive 
variables which are commonly used in hadron collider physics for the purpose of estimating 
the new physics mass scale. Section H is reserved for our main summary and conclusions. 



2. Definition of the RECO level variable yfs, 



(reco) 
min 



In the first approach, we shall not modify the original definition of \/s min and will continue to 
use the usual equation (1.2) (or its equivalent (1.4)), preserving the desired universal, global 



and inclusive character of the y/s min variable. Then we shall concentrate on the question, how 
should one calculate the observable quantities E, P and Pt entering the defining equations 
(O) and (O). 



The previous y/s min studies [1,51,52] used calorimeter-based measurements of the total 
visible energy E and momentum P as follows. The total visible energy in the calorimeter 
i?( ca ;) is simply a scalar sum over all calorimeter deposits 



where the index a labels the calorimeter towers, and E a is the energy deposit in the a tower. 
As usual, since muons do not deposit significantly in the calorimeters, the measured E a 
should first be corrected for the energy of any muons which might be present in the event 
and happen to pass through the corresponding tower a. The three components of the total 
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visible momentum P were also measured from the calorimeters as 

Px(cai) = ^2E a sm9 a cosip a , (2.2) 

a 

= E a s in °a sin ip a , (2.3) 

a 

Pz(cal) =J2 Ea C ° S 6a ' ( 2 - 4 ) 

a 

where 9 a and <p a are correspondingly the polar and azimuthal angular coordinates of the a 

calorimeter tower. The missing transverse momentum can similarly be measured from the 
calorimeter as (see eq. (|1.6|)) 

l?T{cal) = ~ PT(cal)- (2-5) 

Using these calorimeter-based measurements (2.1-2.5), one can make the identification 

(2.6) 
(2.7) 
(2.8) 

in the definition (|1.2|) and construct the corresponding "calorimeter-based" \fs min variable 
as 

This was precisely the quantity which was studied in [1,51,52] and shown to exhibit extreme 
sensitivity to the physics of the underlying event. 

Here we propose to evaluate the visible quantities E and P at the RECO level, i.e. 
in terms of the reconstructed objects, namely jets, muons, electrons and photons 4 . To be 
precise, let there be N Q bj reconstructed objects in the event, with energies E{ and 3-momenta 
Pi, i = 1,2, ... , N Q bj, correspondingly. Then in place of (2.6-Q|), let us instead identify 



E = 


E(cal) ) 


P = 


P(cal) > 


f T = 


Ft(coI) 



N, 



obj 



E = -E'(reco) = ^ Ei , (2.10) 



i=l 

N obj 



P = P ireco) = J2Pi, (2.11) 

1=1 

Ft = FT(reco) = -PT(reco) > ( 2 -12) 

and correspondingly define a "RECO-level" \fs min variable as 



VS ( Zn\W) - y/Ef reco) - P\ reco) + ^W 2 +F 2 T{Teco) , (2-13) 



4 This possibility was briefly alluded to in [1], but not pursued in any detail. 
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which can also be rewritten in analogy to (1.4) as 

- V M (-o) + P T(reco) + ^ + ^W) ' C 2 " 14 ) 

where ^Ttreco) and PTlreco) are related as in eq. ( 2.12| ) and the RECO-level total visible mass 
M( reco ) is defined by 

M (reco) = E (reco) ~ P (reco) ■ (2.15) 

What are the benefits from the new RECO-level y/s min definitions (p.!3| , |2.14| ) m compar- 



ison to the old calorimeter-based \fs min definition in (|2.9D? In order to understand the basic 
idea, it is worth comparing the calorimeter-based missing transverse momentum fa (which in 
the literature is commonly referred to as "missing transverse energy" $t) and the analogous 
RECO-level variable fly, the "missing fly". The fly vector is defined as the negative of the 
vector sum of the transverse momenta of all reconstructed objects in the event: 

N ob] 
i=l 

Then it is clear that in terms of our notation here, flr is nothing but fa( reco ). 

It is known that At performs better than $t [58]. First, flr is less affected by a number 
of adverse instrumental factors such as: electronic noise, faulty calorimeter cells, pile-up, etc. 
These effects tend to populate the calorimeter uniformly with unclustered energy, which will 
later fail the basic quality cuts during object reconstruction. In contrast, the true missing 
momentum is dominated by clustered energy, which will be successfully captured during 
reconstruction. Another advantage of fly is that one can easily apply the known jet energy 
corrections to account for the nonlinear detector response. For both of these reasons, CMS 
is now using fly at both the trigger level and offline [58]. 

Now realize that y/s^^ ^ s analogous to the calorimeter-based while our new variable 
V^min * s analogous to the RECO-level fly. Thus we may already expect that v^min wm 
inherit the advantages of fly and will be better suited for determining the new physics 
mass scale than the calorimeter-based quantity \fs^ n This expectation is confirmed in 
the explicit examples studied below in Sees. |4] and ||. Apart from the already mentioned 
instrumental issues, the most important advantage of V^min f rom the physics point of view 
is that it is much less sensitive to the effects from the underlying event, which had doomed 
its calorimeter-based V^min cousrn - 

Strictly speaking, the idea of does not solve the underlying event problem com- 

pletely and as a matter of principle. Every now and then the underlying event will still 
produce a well-defined jet, which will have to be included in the calculation of v^min • 
cause of this effect, we cannot any more guarantee that y's^™' provides a lower bound on 
the true value y/s true of the center-of-mass energy of the hard scattering — the additional 
jets formed out of ISR, pile-up, and so on, will sometimes cause yfs^f 1 *° excee d V^true- 
Nevertheless we find that this effect modifies only the shape of the v^min distribution, but 
leaves the location of its peak largely intact. To the extent that one is mostly interested in 
the peak location, v^min should already be good enough for all practical purposes. 
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Figure 2: A rearrangement of Fig. [I] into an event topology exhibiting a well defined subsystem 
(delineated by the black rectangle) with total invariant mass Vi (s " b) . There are n su b visible particles 
Xi, i = 1,2,..., n 3U t), originating from within the subsystem, while the remaining n V i S — n su t, visible 
particles X ntnb +i , . . . , X nvig are created upstream, outside the subsystem. The subsystem results 
from the production and decays of a certain number of parent particles Pj, j = 1, 2, . . . ,n p , (some 
of) which may decay semi-invisibly. All invisible particles xii ■ • ■ > Xn;„„ are then assumed to originate 
from within the subsystem. 

3. Definition of the subsystem variable v^mTn 

In this section we propose an alternative modification of the original \fs min variable, which 
solves the underlying event problem completely and as a matter of principle. The downside 
of this approach is that it is not as general and universal as the one discussed in the previous 
section, and can be applied only in cases where one can unambiguously identify a subsystem 
of the original event topology which is untouched by the underlying event. The basic idea 
is schematically illustrated in Fig. ||, which is nothing but a slight rearrangement of Fig. [l] 
exhibiting a well defined subsystem (delineated by the black rectangle). The original n v i s 
visible particle Xi from Fig. p] have now been divided into two groups as follows: 

1. There are n su b visible particles X\, . . . , X nsub originating from within the subsystem. 
Their total energy and total momentum are denoted by Er sub \ and P( su fy , correspond- 
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ingly. The subsystem particles are chosen so that to guarantee that they could not have 
come from the underlying event. 



2. The remaining n v i s — n sub visible particles X nsub+ i, . . . , X Uvig are created upstream 
(outside the subsystem) and have total energy Er up \ and total momentum P( up y The 
upstream particles may originate from the underlying event or from decays of heavier 
particles upstream - this distinction is inconsequential at this point. 



We also assume that all invisible particles xij • • • > Xn inv originate from within the subsystem, 
i.e. that no invisible particles are created upstream. In effect, all we have done in Fig. || is 
to partition the original measured values of the total visible energy E and 3-momentum P 
from Fig. [I] into two separate components as 

E = E( up ) + S( SU 6) , (3.1) 
P = P {up) + P {sub) . (3.2) 

Notice that now the missing transverse momentum is defined as 

Ft = -Pt(u P ) - PT(sub) > (3-3) 
while the total visible invariant mass M( sub ^ of the subsystem is given by 

M fsub) = E (sub) ~ P (sub) ■ ( 3 - 4 ) 

At this point the reader may be wondering what are the guiding principles for categorizing 
a given visible particle X; t as a subsystem or an upstream particle. Since our goal is to identify 
a subsystem which is shielded from the effects of the underlying event, the safest way to do 
the partition of the visible particles is to require that all QCD jets belong to the upstream 
particles, while the subsystem particles consist of objects which are unlikely to come from 
the underlying event, such as isolated electrons, photons and muons (and possibly identified 
r-jets and, to a lesser extent, tagged 6-jets). 

With those preliminaries, we are now ready to ask the usual \fs min question: Given 
the measured values of E( up y E( sub y P( U p) an d P(sub)> what is the minimum value \^s^^ 
of the subsystem Mandelstam invariant mass variable yj~s" su ^ , which is consistent with those 
measurements? Proceeding as in [1], once again we find a very simple universal answer, 
which, with the help of fl3.3|) and Q3.4|), can be equivalently written in several different ways 
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as follows: 




1 



7 



M fsu b ) + P T(su b) + V^ 2+ % ~ Pt(u p) } (3-6) 



( ^ M U) + P hsu b) + \IW 2 +?i) 2 " (PT ( su b) + Vt) 



(3.7) 



= \\PT(sub)+ Pt\ \ , (3.8) 
where in the last line we have introduced the Lorentz 1+2 vectors 



PT(sub) = U M (sub) + P T(sub) > P T( S nfe)j ! (3-9) 



j/ T = K/^f2 + ^2 fT \ _ (3 _ 10) 



As usual, the length of a 1+2 vector is computed as \\p\\ = y/p ■ p = \/p\ — p\ — p\- 

Before we proceed to the examples of the next few sections, as a sanity check of the 
obtained result it is useful to consider some limiting cases. First, by taking the upstream 
visible particles to be an empty set, i.e. Pt(up) ~^ 0, we recover the usual expression for 
\fs min given in eqs. (L^,L4). Next, consider a case with no invisible particles, i.e. $1 = 



and correspondingly, fir = 0. In that case we obtain that v^ri^n = -^(s«6)i which is of 
course the correct result. Finally, suppose that there are no visible subsystem particles, i.e. 
E(sub) = P(sub) = ^{sub) = 0. In that case we obtain \/^roin =ffl, which is also the correct 
answer. 

As we shall see, the subsystem concept of Fig. Q will be most useful when the subsystem 
results from the production and decays of a certain number n p of parent particles Pj with 
masses Mp., j = 1,2, ... ,n p , correspondingly. Then the total combined mass of all parent 
particles is given by 

np 

M P = Y,M P] . (3.11) 

i=i 

By the conjecture of ref. [1], the location of the peak of the y^m^n' \M) distribution will 
provide an approximate measurement of M p as a function of the unknown parameter fA. 
By construction, the obtained relationship MpQtt) will then be completely insensitive to the 
effects from the underlying event. 

At this point it may seem that by excluding all QCD jets from the subsystem, we have 
significantly narrowed down the number of potential applications of the \/s^^ variable. Fur- 
thermore, we have apparently reintroduced a certain amount of model-dependence which the 
original \fs min approach was trying so hard to avoid. Those are in principle valid objections, 
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which can be overcome by using the v^min variable introduced in the previous section. 
Nevertheless, we feel that the v^min variable can prove to be useful in its own right, and 
in a wide variety of contexts. To see this, note that the typical hadron collider signatures of 
the most popular new physics models (supersymmetry, extra dimensions, Little Higgs, etc.) 
are precisely of the form exhibited in Fig. One typically considers production of colored 
particles (squarks, gluinos, KK-quarks, etc.) whose cross-sections dominate. In turn, these 
colored particles shed their color charge by emitting jets and decaying to lighter, uncolored 
particles in an electroweak sector. The decays of the latter often involve electromagnetic ob- 
jects, which could be targeted for selection in the subsystem. The V^min variable would then 
be the perfect tool for studying the mass scales in the electroweak sector (in the context of 
supersymmetry, for example, the electroweak sector is composed of the charginos, neutralinos 
and sleptons). 

Before we move on to some specific examples illustrating these ideas, one last com- 
ment is in order. One may wonder whether the v^m^n variable should be computed at the 
RECO-level or from the calorimeter. Since the subsystem will usually be defined in terms 
of reconstructed objects, the more logical option is to calculate \fs^^ at the RECO-level 
and label it as v^mm' • However, to streamline our notation, in what follows we shall 
always omit the "reco" part of the superscript and will always implicitly assume that \fs^^ 
is computed at RECO-level. 



4. SM example: dilepton events from tt production 

In this and the next two sections we illustrate the properties of the new variables v^min^ 
and \/sl^ with some specific examples. In this section we discuss an example taken from 
the Standard Model, which is guaranteed to be available for early studies at the LHC. We 
consider dilepton events from tt pair production, where both Ws decay leptonically. In this 
event topology, there are two missing particles (two neutrinos). Therefore, these events very 
closely resemble the typical SUSY- like events, in which there are two missing dark matter 
particles. In the next two sections, we shall also consider some SUSY examples. In all cases, 
we perform detailed event simulation, including the effects from the underlying event and 
detector resolution. 



4.1 Event simulation details 

Events are generated with PYTHIA [59] (using its default model of the underlying event) at 
an LHC of 14 TeV, and then reconstructed with the PGS detector simulation package [60]. 
We have made certain modifications in the publicly available version of PGS to better match 
it to the CMS detector. For example, we take the hadronic calorimeter resolution to be [61] 
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while the electromagnetic calorimeter resolution is [61] 



tf\2 / S \ 2 {N 



■*> \VeJ + {e) +c2 - (4 ' 2 > 

where the energy E is measured in GeV, S = 3.63% is the stochastic term, N = 0.124 is the 
noise and C = 0.26% is the constant term. Muons are reconstructed within |?7| < 2.4, and we 
use the muon global reconstruction efficiency quoted in [61]. We use default px cuts on the 
reconstructed objects as follows: 3 GeV for muons, 10 GeV for electrons and photons, and 
15 GeV for jets. 

For the ti example presented in this section, we use the approximate next-to-next-to- 
leading order ti cross-section of a t i = 894 ± 4^4g^}2 pb at a top mass of m t = 175 GeV [62]. 
For the SUSY examples in the next two sections we use leading order cross-sections. 

Since our examples are meant for illustration purposes only, we do not include any 
backgrounds to the processes being considered, nor do we require any specific triggers. A 
detailed study of the dilepton ti signature including all those effects will appear elsewhere [63] . 

4 - 2 \^min 0) variable 

We first consider SUSY-like missing energy events arising from ti production, where each 
I'F-boson is forced to decay leptonically (to an electron or a muon). We do not impose any 
trigger or offline requirements, and simply plot directly the output from PGS 5 . We show 
various -y/s quantities of interest in Fig. ||, setting }A = 0, since in this case the missing 
particles are neutrinos and are massless. The dotted (yellow-shaded) histogram represents 
the true yfs distribution of the ti pair. It quickly rises at the ti mass threshold 

M p = 2m t = 350 GeV (4.3) 

and then eventually falls off at large ^fs due to the parton density function suppression. 
Because the top quarks are typically produced with some boost, the ~\/~Si rue distribution in 
Fig. H peaks a little bit above threshold: 

(V~s true ) peak >M p . (4.4) 

It is clear that if one could directly measure the \fs true distribution, or at least its onset, 
the ti mass scale will be easily revealed. Unfortunately, the escaping neutrinos make such a 
measurement impossible, unless one is willing to make additional model-dependent assump- 
tions 6 . 

5 Therefore, our plots in this subsection are normalized to a total number of events equal to a t j x BR(W — > 
e,/i) 2 . 

6 For example, one can use the known values of the neutrino, W and top masses to solve for the neutrino 
kinematics (up to discrete ambiguities). However, this method assumes that the full mass spectrum is already 
known, and furthermore, uses the knowledge of the top decay topology to perfectly solve the combinatorics 
problem discussed in the Introduction. As an example, consider a case where the lepton is produced first 
and the b-quark second, i.e. when the top first decays to a lepton and a leptoquark, which in turn decays to 
a neutrino and a b-quark. The kinematic method would then be using the wrong on-shell conditions. The 
advantage of the y/s min approach is that it is fully inclusive and does not make any reference to the actual 
decay topology. 
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Figure 3: Distributions of various \/s min quantities discussed in the text, for the dilepton tt sample 
at the LHC with 14 TeV CM energy and 0.5 fb _1 of data. The dotted (yellow-shaded) histogram gives 
the true y/s distribution of the tt pair. The blue histogram is the distribution of the calorimeter-based 
V^min var iable in the ideal case when all effects from the underlying event are turned off. The red 
histogram shows the corresponding result for v^min m the presence of the underlying event. The black 
histogram is the distribution of the v^min^ variable introduced in Sec. [| All \fs min distributions 
are shown for IjH = 0. 




Fig. [3| also shows two versions of the calorimeter-based variable: the blue (red) 

histogram is obtained by switching off (on) the underlying event (ISR and MPI). These 
curves reveal two very interesting phenomena. First, without the UE, the peak of the 
distribution (blue histogram) is very close to the parent mass threshold [1]: 

noUE (VS) »M P . (4.5) 

V / peak 

The main observation of Ref. [1] was that this correlation offers an alternative, fully inclusive 
and model- independent, method of estimating the mass scale M p of the parent particles, even 
when some of their decay products are invisible and not seen in the detector. 

Unfortunately, the "no UE" limit of eq. (|4.5|) is unphysical, and the corresponding 



distribution (blue histogram in in Fig. |3|) is unobservable. What is worse, when one tries to 
measure the \/s^^ distribution in the presence of the UE (red histogram in Fig. ^), the 
resulting peak is very far from the physical threshold: 

with UE (v^S) , » K . (4.6) 

V / peak 
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In the ti example of Fig. [|, the shift is on the order of 1 TeV! It appears therefore that in 
practice the v^mit P ea ^ would be uncorrelated with any physical mass scale, and instead 
would be completely determined by the (uninteresting) physics of the underlying event. Once 
the nice model-independent correlation of eq. Q4.5| ) is destroyed by the UE, it becomes of only 
academic value [1,6,51-53]. 

However, Fig. also suggests the solution to this difficult problem. If we look at the 
distribution of the y^Wn variable (black solid histogram) , we see that its peak has returned 
to the desired value: 

(V^rO «Af p , (4.7) 

V / peak 

thus resurrecting the original proposal of Ref. [1]. In order to measure physical mass thresh- 
olds, one simply needs to investigate the distribution of the inclusive v^min variable, which 
is calculated at RECO-level. Each peak in that distribution signals the opening of a new 
channel, and from ([4.7D the location of the peak provides an immediate estimate of the total 
mass of all particles involved in the production. Of course, the ^fs^in^ distribution is now 
not as sharply peaked as the unphysical "no UE" case of v^min > but as long as its peak is 
found in the right location, the method of Ref. [1] becomes viable once again. 

Our first main result is therefore nicely summarized in Fig. ^jj, which shows a total of 4 
distributions, 3 of which are either unphysical (the blue histogram of v^min * n absence of 
the UE), unobservable (the yellow-shaded histogram of y/s true ), or useless (the red histogram 
of "v/^min ^ n the presence of the UE). The only distribution in Fig. ||| which is physical, 
observable and useful at the same time, is the distribution of v^min ( son d black histogram). 

Before concluding this subsection, we explain the reason for the improved performance 
of the \/^mfra°^ variable in comparison to the y^fj version. As already anticipated in Sec. 
the basic idea is that energy deposits which are due to hard particles originating from the 
hard scattering, tend to be clustered, while the energy deposits due to the UE tend to be 
more uniformly spread throughout the detector. In order to see this pictorially, in Figs. |I] 
and [| we show a series of calorimeter maps of the combined ECAL+HCAL energy deposits 
as a function of the pseudorapidity rj and azimuthal angle (p. Since the calorimeter in PGS is 
segmented in cells of (At/, A<p) = (0.1, 0.1), each calorimeter tower is represented by a square 
pixel, which is color-coded according to the amount of energy present in the tower. We have 
chosen the color scheme so that larger deposits correspond to darker colors. 

Each calorimeter map figure below has four panels. In the upper two panels the calorime- 
ter is filled at the parton level directly from PYTHIA. This corresponds to a perfect detector, 
where we ignore any smearing effects due to the finite energy resolution. The lower two plots 
in each figure show the corresponding results after PGS simulation. Thus by comparing the 
plots in the upper row to those in the bottom row, one can see the effect of the detector 
resolution. While the finite detector resolution does play some role, we find that it is of no 
particular importance for understanding the reason behind the big swings in the \fs min peaks 
observed in Fig. |||. 

Let us instead concentrate on comparing the plots in the left column versus those in 
the right column. The left plots show the absolute energy deposit E a in the a calorimeter 
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Figure 4: PGS calorimeter map of the energy deposits, as a function of pseudorapidity rj and 
azimuthal angle <p, for a dilepton ti event with only two reconstructed jets. At the parton level, this 
particular event has two 6-quarks and two electrons. The location of a 6-quark (electron, muon) is 
marked with the letter "q" ("e", 'V). A grey circle delineates (the cone of) a reconstructed jet, 
while a green dotted circle denotes a reconstructed lepton. In the upper two plots the calorimeter is 
filled at the parton level directly from PYTHIA, while the lower two plots contain results after PGS 
simulation. The left plots show absolute energy deposits E a , while in the right plots the energy in 
each tower is shown projected on the transverse plane as E a cos9 a . 



tower, while in the right plots this energy is shown projected on the transverse plane as 
E a cos0 a . The difference between the left and the right plots is quite striking. The plots 
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Figure 5: The same as Fig. |[ but for an event with three additional reconstructed jets. 



on the left exhibit lots of energy, which is deposited mostly in the forward calorimeter cells 
(at large \r]\) [1]. The plots on the right, on the other hand, show only a few clusters of 
energy, concentrated mostly in the central part of the detector. Those energy clusters give 
rise to the objects (jets, electrons and photons) which are reconstructed from the calorimeter. 
Furthermore, each energy cluster can be easily identified with a parton-level particle in the top 
decay chain. In order to exhibit this correlation, in Figs. |4| and || we use the following notation 
for the parton-level particles: a 6-quark (electron, muon) is marked with the letter "q" ("e", 
"/i"). A grey circle delineates (the cone of) a reconstructed jet, while a green dotted circle 
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Event type 


PYTHIA parton level 


after PGS simulation 




r-(cal) 
V b rain 


r-(cal) 
V b min 


r-(reco) 
V b min 


it event in Fig. |] 


427 


1110 


1179 


363 


ti event in Fig. [| 


638 


2596 


2761 


736 


SUSY event in Fig. 12 


1954 


3539 


3509 


2085 



Table 1: Selected yfs quantities (in GeV) for the events shown in Figs. [|, |^ and |l2|. The second 
column shows the true invariant mass \fs true of the parent system: top quark pair in case of Figs. |] 
and ^ or gluino pair in case of Fig. [l^. The third column shows the value of the V^Viin variable 



9|) calculated at the parton level, without any PGS detector simulation, but with the full detector 
acceptance cut of \r)\ < 4.1. The fourth column lists the value of V^min obtained after PGS detector 



simulation, while the last column shows the value of the \fs ( ^f > variable defined in (2.13) 



marks a reconstructed lepton (electron or muon). The lepton isolation requirement implies 
that green circles should be void of large energy deposits off-center, and indeed we observe 
this to be the case. 

In particular, Fig. || shows a bare-bone dilepton ti event with just two reconstructed jets 
and two reconstructed leptons (which happen to be both electrons). As seen in the figure, 
the two jets can be easily traced back to the two 6-quarks at the parton level, and there are 
no additional reconstructed jets due to the UE activity. Because the event is so clean and 
simple, one might expect to obtain a reasonable value for \fs min , he. close to the ti threshold. 
However, this is not the case, if we use the calorimeter-based measurement V^min • As seen 
in Table [l], the measured value of v^Vnin ^ s ver Y f ar off — on the order of 1 TeV, even in the 
case of a perfect detector. The reason for this discrepancy is now easy to understand from 
Fig. ^. Recall that v^min ^ s defined in terms of the total energy Ev^n in the calorimeter, 
which in turn is dominated by the large deposits in the forward region, which came from the 
underlying event. More importantly, those contributions are more or less equally spread over 
the forward and backward region of the detector, leading to cancellations in the calculation 
of the corresponding longitudinal P z ( ca i) momentum component. As a result, the first term 
in (|2.9|) becomes completely dominated by the UE contributions [51]. 

Let us now see how the calculation of \fs^f l is affected by the UE. Since object recon- 
struction is done with the help of minimum transverse cuts (for clustering and object id), 
the relevant calorimeter plots are the maps on the right side in Fig. |j We see that the large 
forward energy deposits which were causing the large shift in y^mtn are n °t incorporated 
into any reconstructed objects, and thus do not contribute to the yfs^in^ calculation at all. 
In effect, the RECO-level prescription for calculating yfs min is leaving out precisely the un- 
wanted contributions from the UE, while keeping the relevant contributions from the hard 
scattering. As seen from Table [I], the calculated value of \fs^° for that event is 363 GeV, 
which is indeed very close to the ti threshold. It is also smaller than the true y/s value of 
427 GeV in that event, which is to be expected, since by design y/s min < and this event 
does not have any extra ISR jets to spoil this relation. 

It is instructive to consider another, more complex ti dilepton event, such as the one 
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shown in Fig. ||. The corresponding calculated values for v^min anc ^ V^min^ are shown m 
the second row of Table [l|. As seen in Fig. pi this event has additional jets and a lot more 
UE activity. As a result, the calculated value of \/*min * s shifted by almost 2 TeV from 
the nominal yfs true value. Nevertheless, the RECO-level prescription nicely compensates for 
this effect, and the calculated \/*min^ vame is onr y 736 GeV, which is within 100 GeV of 
the nominal \fs t = 638 GeV. Notice that in this example we end up with a situation 
where x/sj^ > V^true- Fig. H indicates that this happens quite often — the tail of the 
V^m^° distribution is more populated than the (yellow-shaded) \fs true distribution. This 
should be no cause for concern. First of all, we are only interested in the peak of the \/s^^ 
distribution, and we do not need to make any comparisons between \/s^„ an< ^ V$ true- 
Second, any such comparison would be meaningless, since the value of \fs true is a priori 
unknown, and unobservable. 

4 - 3 variable 

Before concluding this section, we shall use the tt example to also illustrate the idea of 
the subsystem \/s^^ variable developed in Sec. H Dilept on tt events are a perfect testing 
ground for this idea, since the WW subsystem decays leptonically, without any jet activity. 
We therefore define the subsystem as the two hard isolated leptons resulting from the decays of 
the TV-bosons. Correspondingly, we require two reconstructed leptons (electrons or muons) 
at the PGS level 7 , and plot the distribution of the leptonic subsystem v^mi? variable in 
Fig. ^. As before, the dotted (yellow-shaded) histogram represents the true y/s distribution 
of the W + W~ pair. As expected, it quickly rises at the WW threshold (denoted by the 
vertical arrow), then falls off at large \fs. Since the \fsp^Y distribution is unobservable, 
the best we can do is to study the corresponding \/*min distribution shown with the solid 
black histogram. In this subsystem example, all UE activity is lumped together with the 
upstream 6-jets from the top quarks decays, and thus has no bearing on the properties of the 
leptonic y/~s£^ • I n particular, we find that the value of y^^' is always smaller than the true 
\/strue W ^ ■ More importantly, Fig. || demonstrates that the peak in the \fs^^ distribution is 
found precisely at the mass threshold of the particles (in this case the two W bosons) which 
initiated the subsystem. Therefore, in analogy to ( [4.7] ) we can also write 

& (4.8) 

/ peak 

where Mp Sub ^ is the combined mass of all the parents initiating the subsystem. Fig. ^ shows 
that in the tt example just considered, this relation holds to a very high degree of accuracy. 

This example should not leave the reader with the impression that hadronic jets are never 
allowed to be part of the subsystem. On the contrary — the subsystem may very well include 
reconstructed jets as well. The ti case considered here in fact provides a perfect example to 
illustrate the idea. 



7 The selection efficiency for the two leptons is on the order of 60%, which explains the different normalization 
of the distributions in Figs. ^ and |(| 
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Figure 6: The same as Fig. Ej, but for the dilepton subsystem in dilepton tt events with two recon- 
structed leptons in PGS. The dotted (yellow-shaded) histogram gives the true yfs distribution of the 
W + W~ pair in those events. The black histogram shows the distribution of the (leptonic) subsystem 
variable v^m"^ defined in Sec. [| In this case, the subsystem is defined by the two isolated leptons, 
while all jets are treated as upstream particles. The vertical arrow marks the W + W~ mass threshold. 



Let us reconsider the tt dilepton sample, and redefine the subsystem so that we now tar- 
get the two top quarks as the parents initiating the subsystem. Correspondingly, in addition 
to the two leptons, let us allow the subsystem to include two jets, presumably coming from 
the two top quark decays. Unfortunately, in doing so, we must face a variant of the partition- 
ing 8 combinatorial problem discussed in the introduction: as seen in Fig. ||, the typical jet 
multiplicity in the events is relatively high, and we must therefore specify the exact procedure 
how to select the two jets which would enter the subsystem. We shall consider three different 
approaches. 

• B-tagging. We can use the fact that the jets from top quark decay are 6-jets, while the 
jets from ISR are typically light flavor jets. Therefore, by requiring exactly two b-tags, 
and including only the two ^-tagged jets as part of the subsystem, we can significantly 
increase the probability of selecting the correct jets. Of course, ISR will sometimes 
also contribute 6-tagged jets from gluon splitting, but that happens rather rarely and 
the corresponding contribution can be suppressed by a further invariant mass cut on 
the two 6-jets. The resulting V^min distribution for the subsystem of 2 leptons and 2 
6-tagged jets is shown in Fig. || with the black histogram. We see that, as expected, 
the distribution peaks at the tt threshold and this time provides a measurement of the 

8 By construction, the y/s min and v^m"n' variables never have to face the ordering combinatorial problem. 
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Figure 7: Unit-normalized distribution of jet multiplicity in dilepton tt events. 



top quark mass: 



(sub) 
min 



peak 



= 2m t = 350 GeV . 



(4.9) 



The disadvantage of this method is the loss in statistics: compare the normalization 
of the black histogram in Fig. || after applying the two 6-tags, to the dotted (yellow- 
shaded) distribution of the true ti distribution in the selected inclusive dilepton sample 
(without 6-tags). 

• Selection by jet pt- Here one can use the fact that the jets from top decays are on 
average harder than the jets from ISR. Correspondingly, by choosing the two highest 
Pt jets (regardless of 6-tagging), one also increases the probability to select the correct 
jet pair. The corresponding distribution is shown in Fig. ^ with the blue histogram, 
and is also seen to peak at the ti threshold. An important advantage of this method is 
that one does not have to pay the price of reduced statistics due to the two additional 
6-tags. 

• No selection. The most conservative approach would be to apply no selection criteria 
on the jets, and include all reconstructed jets in the subsystem. Then the subsystem 
V^mln variable essentially reverts back to the RECO-level inclusive variable y^min^ 
already discussed in the previous subsection. Not surprisingly, we find the peak of its 
distribution (red histogram in Fig. |8|) near the ti threshold as well. 

All three of these examples show that jets can also be usefully incorporated into the 
subsystem. The only question is whether one can find a reliable way of preferentially selecting 
jets which are more likely to originate from within the intended subsystem, as opposed to 
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Figure 8: The same as Fig. ||, but in addition to the two leptons, the subsystem now also includes: 
exactly two 6-tagged jets (black histogram); the two highest px jets (blue histogram); or all jets (red 
histogram). The dotted (yellow-shaded) histogram gives the true y/s distribution of the tt pair. 

from the outside. As we see in Fig. |1| in the tt case this is quite possible, although in general 
it may be difficult in other settings, like the SUSY examples discussed in the next section. 



5. An exclusive SUSY example: multijet events from gluino production 

Since \fs min is a fully inclusive variable, arguably its biggest advantage is that it can be 
applied to purely jetty events with large jet multiplicities, where no other method on the 
market would seem to work. In order to simulate such a challenging case, we consider gluino 
pair production in super symmetry, with each gluino forced to undergo a cascade decay chain 
involving only QCD jets and nothing else. For concreteness, we revisit the setup of Ref. [1], 
where two different possibilities for the gluino decays were considered: 

• In one scenario, the gluino g is forced to undergo a two-stage cascade decay to the LSP. 
In the first stage, the gluino decays to the second-lightest neutralino x\ an( ^ two Q uar k 
jets: g — > qqx®- I n turn, \\ itself is then forced to decay via a 3-body decay to 2 quark 
jets and the LSP: x% ~^ QQXv The resulting gluino signature is 4 jets plus missing 
energy: 

9-> nxl -> jnixi ■ (5-1) 

Therefore, gluino pair production will nominally result in 8 jet events. Of course, as 
shown in Fig. ||, the actual number of reconstructed jets in such events is even higher, 
due to the effects of ISR, FSR and/or string fragmentation. As seen from the figure, each 
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Figure 9: Unit-normalized distribution of jet multiplicity in gluino pair production events, with each 
gluino decaying to four jets and a Xi LSP as in (|5.l|) . 

such event has on average ~ 10 jets, presenting a formidable combinatorics problem. 
We suspect that all 9 mass reconstruction methods on the market are doomed if they 
were to face such a scenario. It is therefore of particular interest to see how well the 
y/s min method (which is advertized as universally applicable) would fare under such 
dire circumstances. 

• In the second scenario, the gluino decays directly to the LSP via a three-body decay 

9 -> ml , (5-2) 
so that gluino pair-production events would nominally have 4 jets and missing energy. 

For concreteness, in each scenario we fix the mass spectrum as was done in [1]: we use the 
approximate gaugino unification relations to relate the gaugino and neutralino masses as 

nig = 3m^o = 6m^o . (5.3) 

We can then vary one of these masses, and choose the other two in accord with these relations. 
Since we assume three-body decays in ( fTj ) and (|5.1|), we do not need to specify the SUSY 
scalar mass parameters, which can be taken to be very large. In addition, as implied by 
Q5.3| ), we imagine that the lightest two neutralinos are gaugino-like, so that we do not have 
to specify the higgsino mass parameter either, and it can be taken to be very large as well. 
After these preliminaries, our results for these two scenarios are shown in Figs. Ill] and 



11, correspondingly. In Fig. |l_0| (Fig. |ll| ) we consider the 8-jet signature arising from (|5^ 



9 With the possible exception of the Mrgen method of Ref. [32], see Section |^ below. 
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Figure 10: The same as Fig. ||, but for a SUSY example of gluino pair production, with each gluino 
decaying to four jets and a Xi LSP as indicated in fl5.l|). The mass spectrum is chosen as: (a) rrig = 600 
GeV, m^o = 200 GeV and m^o = 100 GeV; or (b) m s = 2400 GeV, m^, = 800 GeV and m^o = 400 
GeV. All three \/s min distributions are plotted for the correct value of the missing mass parameter, 
in this case IflL — 2m^o . 



a 

> 

O) 

o 

o 



X) 

\ 

Z 
T3 



800 



600 



400 



200 



, , , , 1 , , 


, , | , , , , | , , , , 






1 


10k events 


V3£? II' 1 ] 




no UE J ■ 1 ; 


m ? =600 GeV 




, Vst™ m X ;=100 GeV ; 




Xn": °min 


(a) J 1/ 





pj 600 
> 

0) 

O 
o 

>" 400 



^ 200 

\ 

Z 



1000 2000 

Vi" min (GeV) 



3000 



4000 



3000 



.'II 
_ (b) 


' ' ' ' 1 ' 1 ' ' 1 ' 1 ' 1 




)J=2mj; 




10k events 




\ ^true 


" viS? \ 


m g -=2400 GeV- 


no UE j 


mjo-400 GeV 


„W 1 ' J l . . . ■ 


klVtU Vs (cal) 



4000 



5000 6000 

Vimin (GeV) 



7000 



8000 



Figure 11: The same as Fig. [lO], but for the case of gluino decays to 2 jets and a Xi LSP as in (5.2) 



(the 4-jet signature arising from ( |5.2| )). In both figures, panels (a) correspond to a light mass 
spectrum m g = 600 GeV, m^o = 200 GeV and m^o = 100 GeV; while panels (b) correspond 
to a heavy mass spectrum nig = 2400 GeV, m^o = 800 GeV and m^o = 400 GeV. Each plot 
shows the same four distributions as in Fig. |3[ The \fs m i n distributions are all plotted for 
the correct value of the missing mass parameter, namely = 2m^o. 



Overall, the results seen in Figs. 10 and |ITj are not too different from what we already 
witnessed in Fig. ^ for the ti example. The (unobservable) distribution \fs true shown with 
the dotted yellow-shaded histogram has a sharp turn-on at the physical mass threshold M p = 
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Figure 12: The same as Fig. [I| but for a SUSY event of gluino pair production, with each gluino 
forced to decay to 4 jets and the LSP as in (5.1). The SUSY mass spectrum is as in Figs. |lO|(a) 
and [Uja): m g = 600 GeV, m^o = 200 GeV and m^o = 100 GeV. As in Figs. | and §, the circles 
denote jets reconstructed in PGS, and here "q" marks the location of a quark from a gluino decay 
chain. Therefore, a circle without a "q" inside corresponds to a jet resulting from ISR or FSR, while 
a letter "q" without an accompanying circle represents a quark in the gluino decay chain which was 
not subsequently reconstructed as a jet. 



2rrig. If the effects of the UE are ignored, the position of this threshold is given rather well 
by the peak of the \/s^ l J distribution (blue histogram). Unfortunately, the UE shifts the 
peak in y^min ^ ( rec ^ histogram). Fortunately, the distribution of the RECO-level 
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variable V^min (black histogram) is stable against UE contamination, and its peak is still 
in the right place (near M p ). 

Having already seen a similar behavior in the ti example of the previous section, these 
results may not seem very impressive, until one realizes just how complicated those events are. 
For illustration, Fig. ^ shows the previously discussed calorimeter maps for one particular 
"8 jet" event. This event happens to have 11 reconstructed jets, which is consistent with 
the typical jet multiplicity seen in Fig. ^. The values of the \/s quantities of interest for 
this event are listed in Table ||. We see that the RECO prescription for calculating \fs min is 
able to compensate for a shift in y/s of more than 1.5 TeV! A casual look at Fig. 12 should 
be enough to convince the reader just how daunting the task of mass reconstruction in such 
events is. In this sense, the ease with which the \fs min method reveals the gluino mass scale 
in Figs. IT?] and 11 is quite impressive. 



6. An inclusive SUSY example: GMSB study point GMlb 

In the Introduction we already mentioned that \fs min is a fully inclusive variable. Here we 
would like to point out that there are two different aspects of the inclusivity property of 

min' 

• Object-wise inclusivity: \/s min is inclusive with regards to the type of reconstructed 
objects. The definition of V^min does not distinguish between the different types of 
reconstructed objects (and makes no reference to any reconstructed objects at 
all). This makes \/s min a very convenient variable to use in those cases where the 
newly produced particles have many possible decay modes, and restricting oneself to a 
single exclusive signature would cause loss in statistics. For illustration, consider the 
gluino pair production example from the previous section. Even though we are always 
producing the same type of parent particles (two gluinos), in general they can have 
several different decay modes, leading to a very diverse sample of events with varying 
number of jets and leptons. Nevertheless, the v^min distribution, plotted over this 
whole signal sample, will still be able to pinpoint the gluino mass scale, as explained in 
Sec. §. 

• Event-wise inclusivity: \fs min is inclusive also with regards to the type of events, i.e. the 
type of new particle production. For simplicity, in our previous examples we have been 
considering only one production mechanism at a time, but this is not really necessary — 
\/s m i n can also be applied in the case of several simultaneous production mechanisms. 

In order to illustrate the last point, in this section we shall consider the simultaneous 
production of the full spectrum of SUSY particles at a particular benchmark point. We chose 
the GMlb CMS study point [64], which is nothing but a minimal gauge-mediated SUSY- 
breaking (GMSB) scenario on the SPS8 Snowmass slope [65]. The input parameters are 
A=80 TeV, M mes =160 TeV, N mes =l, tan/3 = 15 and \i > 0. The physical mass spectrum is 
given in Table |2[ Point GMlb is characterized by a neutralino NLSP, which promptly decays 
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Table 2: Masses (in GeV) of the SUSY particles at the GMlb study point. Here u and d (l and vg) 
stand for either of the first two generations squarks (sleptons) . 
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Figure 13: Distribution of the v^min (dotted red) and v^min^ (solid black) variables in inclusive 
SUSY production for the GMSB GMla benchmark study point with parameters A = 80 TeV, M mes 
1 fit) TeV, N mes = 1, tan/3 = 15 and [i > 0. The dotted yellow-shaded histogram is the true y/s 
distribution of the parent pair of SUSY particles produced at the top of each decay chain (the identity 
of the parent particles varies from event to event). The \fs min distributions are shown for ])H = and 
are normalized to 1 fb _1 of data. The vertical arrows mark the mass thresholds for a few dominant 
SUSY pair-production processes. 



(predominantly) to a photon and a gravitino. Therefore, a typical event has two hard photons 
and missing energy, which provide good handles for suppressing the SM backgrounds. 

We now consider inclusive production of all SUSY subprocesses and plot the y/s min 
distributions of interest in Fig. As usual, the dotted yellow-shaded histogram is the true 
yfs distribution of the parent pair of SUSY particles produced at the top of each decay chain. 
Since we do not fix the production subprocess, the identity of the parent particles varies from 
event to event. Naturally, the most common parent particles are the ones with the highest 
production cross-sections. For point GMlb, at a 14 TeV LHC, strong SUSY production 
dominates, and is 87% of the total cross-section. A few of the dominant subprocesses and 
their cross-sections are listed in Table |3| 
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Table 3: Cross-sections (in pb) and parent mass thresholds (in GeV) for the dominant production 
processes at the GMlb study point. The listed squark cross-sections are summed over the light squark 
flavors and conjugate states. The total SUSY cross-section at point GMlb is 9.4 pb. 



The true yfs distribution in Fig. ^ exhibits an interesting double-peak structure, which 
is easy to understand as follows. As we have seen in the exclusive examples from Sees. |] and |5[ 
at hadron colliders the particles tend to be produced with yfs close to their mass threshold. As 
seen in Table g, the particle spectrum of the GMlb point can be broadly divided (according 
to mass) into two groups of superpartners: electroweak sector (the lightest chargino Xi > 
second-to-lightest neutralino x°> an< ^ sleptons) with a mass scale on the order of 200 GeV and 
a strong sector (squarks and gluino) with masses of order 700 — 900 GeV. The first peak in the 
true yfs distribution (near yfs ~ 500 GeV) arises from the pair production of two particles 
from the electroweak sector, while the second, broader peak in the range of yfs ~ 1500 — 2300 
GeV is due to the pair production of two colored superpartners 10 . Each one of those peaks 
is made up of several contributions from different individual subprocesses, but because their 
mass thresholds 11 are so close, in the figure they cannot be individually resolved, and appear 
as a single bump. 

If one could somehow directly observe the true yfs SUSY distribution (the dotted yellow- 
shaded histogram in Fig. |l3|), this would lead to some very interesting conclusions. First, from 
the presence of two separate peaks one would know immediately that there are two widely 
separated scales in the problem. Second, the normalization of each peak would indicate 
the relative size of the total inclusive cross-sections (in this example, of the particles in the 
electroweak sector versus those in the strong sector). Finally, the broadness of each peak 
is indicative of the total number of contributing subprocesses, as well as the typical mass 
splittings of the particles within each sector. It may appear surprising that one is able to 
draw so many conclusions from a single distribution of an inclusive variable, but this just 
comes to show the importance of yfs as one of the fundamental collider physics variables. 

Unfortunately, because of the missing energy due to the escaping invisible particles, the 
true yfs distribution cannot be observed, and the best one can do to approximate it is to look 
at the distributions of our inclusive yfs min variables discussed in Sec. |2[ the calorimeter-based 
V^min variable (dotted red histogram in Fig. [[D and the RECO-level y/s { ™™ ] variable (solid 
black histogram in Fig. |13|). In the figure, both of those are plotted for Ifl = 0. 

First let us concentrate on the calorimeter-based version \fs^2 (dotted red histogram) . 
We can immediately see the detrimental effects of the UE: first, the electroweak production 
peak has been almost completely smeared out, while the strong production peak has been 



10 The attentive reader may also notice two barely visible bumps (near 950 GeV and 1150 GeV) reflecting 
the associated production of one colored and one uncolored particle: gxt<9X2 and qxt,<lX2> correspondingly. 
11 A few individual mass thresholds are indicated by vertical arrows in Fig. n3l 
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Figure 14: The same as Fig. ^, but for the GMSB SUSY example considered in Fig. [l3| Here the 
subsystem is defined in terms of the two hard photons resulting from the two Xi — > G + 7 decays. 
The vertical arrow marks the onset for inclusive X\Xi production. 



shifted upwards by more than a TeV! This behavior is not too surprising, since the same 
effect was already encountered in our previous examples in Sees. |] and [|. Fortunately, we 
now also know the solution to this problem: one needs to consider the RECO-level variable 
V^min^ instead, which tracks the true y/s distribution much better. We can see evidence of 
this in Fig. [L3| as well: notice how the (black) \/s„rf„ histogram exhibits the same features 
as the (yellow-shaded) true y/s distribution. In particular, y/s^f^ does show two separate 
peaks (indicating that SUSY production takes place at two different mass scales), the peaks 
are in their proper locations (relative to the missing mass scale $f), and have the correct 
relative width, hinting at the size of the mass splittings in each sector. We thus conclude that 
all of the interesting physics conclusions that one would be able to reach from looking at the 
true y/s distributions, can still be made based on the inclusive distribution of our RECO-level 
V / Smin° ) variable. 

Before concluding this section, we shall take the opportunity to use the GMlb example 
to also illustrate the y/s^^ variable proposed in Sec. ||. As already mentioned, the GMlb 
study point corresponds to a GMSB scenario with a promptly decaying Bino-like Xi NLSP. 
Most events therefore contain two hard photons from the two Xi decays to gravitinos. Then 
it is quite natural to define the exclusive subsystem in Fig. || in terms of these two photons. 
The corresponding y/s^f^ distribution is shown in Fig. 14 with the black solid histogram. 
For completeness, in the figure we also show the true y/s distribution of the Xi P arr (dotted 
yellow-shaded histogram). The vertical arrow marks the location of the X1X1 mass threshold. 
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We notice that the peak of the V^miri^ distribution nicely reveals the location of the neutralino 
mass threshold, and from there the neutralino mass itself. We see that the method of \fs£^ 
provides a very simple way of measuring the NLSP mass in such GMSB scenarios (for an 
alternative approach based on Mt2, see [66]). 

7. Comparison to other inclusive collider variables 

Having discussed the newly proposed variables v^min^ anc ^ V^min m var i° us settings in 
Sees. ||-[| we shall now compare them to some other global inclusive variables which have 
been discussed in the literature in relation to determining a mass scale of the new physics. For 
simplicity here we shall concentrate only on the most model-independent variables, which do 
not suffer from the topological and combinatorial ambiguities mentioned in the Introduction. 

At the moment, there are only a handful of such variables. Depending on the treatment 
of the unknown masses of the invisible particles, they can be classified into one of the following 
two categories: 

• Variables which do not depend on an unknown invisible mass parameter. The most 
popular members of this class are the "missing Ht" variable 



T 



N obj 



(7.1) 



which is simply the magnitude of the JIt vector from eq. ( |2.16 ), and the scalar Ht 
variable 

N ob:j 

H T = H T +Y, Pn • (7.2) 

i=l 

Here we follow the notation from Sec. ||, where Pxi is the measured transverse mo- 
mentum of the i-th reconstructed object in the event (i = 1,2, ... , Nobj). The main 
advantage of Ht and Ht is their simplicity: both are very general, and are defined 
purely in terms of observed quantities, without any unknown mass parameters. The 
downside of Ht and Ht is that they cannot be directly correlated with any physical 
mass scale in a model-independent way 12 . 

Variables which exhibit dependence on one or more invisible mass parameters. As two 
representatives from this class we shall consider MT ge n from Ref. [32] and v^min^ f rom 
Sec. H here. We shall not repeat the technical definition of Mr 9f ,„, and instead refer the 
uninitiated reader to the original paper [32]. Suffice it to say that the method of MTgen 
starts out by assuming exactly two decay chains in each event. The arising combinato- 
rial problem is then solved by brute force — by considering all possible partitions of the 



12 Some early studies of ifr-like variables found interesting linear correlations between the peak in the Ht 
distribution and a suitably defined SUSY mass scale in the context of specific SUSY models, e.g. minimal 
supergravity (MSUGRA) [10,67,68], minimal GMSB [67], or mixed moduli- mediation [69]. However, any such 
correlations do not survive further scrutiny in more generic SUSY scenarios, see e.g. [70]. 
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event into two sides, computing Mp2 for each such partition, and taking the minimum 
value of Mt2 found in the process. Both Mp gen and y's^™' introduce a priori unknown 
parameters related to the mass scale of the missing particles produced in the event. In 
the case of \fs^^\ * n ^ s ^ s simply the single parameter ljf/[ , measuring the total invisible 



mass (in the sense of a scalar sum as defined in eq. The Mp gen variable, on the 

other hand, must in principle introduce two separate missing mass parameters and 
^2 (one for each side of the event). However, the existing applications of Mp gen m 
the literature have typically made the assumption that =^2, although this is not 
really necessary and one could just as easily work in terms of two separate inputs 
and 1^2 [3,4]. The inconvenience of having to deal with unknown mass parameters in 
the case of iWr 9en and \fs^^ is greatly compensated by the luxury of being able to 
relate certain features of their distributions to a fundamental physical mass scale in a 
robust, model-independent way. In particular, the upper endpoint of the Mp gen 

distribution gives the larger of the two parent masses max{ Mp x , MpJ [71]. Therefore, 
if the two parent masses are the same, i.e. Mp 1 = Mp 2 , then the parent mass threshold 
M p = Mp 1 + Mp 2 is simply given by 

M p = 2M^\ (7.3) 

On the other hand, as we have already seen in Sees. |]-||, the peak of the \/s^i*„ ^ s 
similarly correlated with the parent mass threshold, see eq. d4.7[) . 

In principle, all four 13 of these variables are inclusive both object-wise and event-wise. It is 
therefore of interest to compare them with respect to: 

1. The degree of correlation with the new physics mass scale M p . 

2. Stability of this correlation against the detrimental effects of the UE. 



Figs. |15|, |16| and |17J allow for such comparisons. 

In Fig. |l5]we first revisit the case of the dilepton ti sample discussed in Sec. |j. In addition 
to the true yfs (yellow shaded) and v^miri (black) distribution already appearing in Fig. ^ 
we now also plot the distributions of 2Mp gen (red dots), Hp (green dots) and ftp (blue dots), 
all calculated at the RECO-level. For completeness, in Fig. |l5| we also show a variant of 
Mpg en , called Mpp gen (magenta dots), where all visible particle momenta are first projected 
on the transverse plane, before computing Mp gen in the usual way [32] 14 . All results include 
the full simulation of the underlying event. For plotting convenience, the ftp distribution is 
shown scaled down by a factor of 2. 



13 We caution the reader that Ht is often defined in a more narrow sense than eq. ((f^). For example, 
sometimes the fir term is omitted, sometimes the sum in eq. (7.2) is limited to the reconstructed jets only; 



or to the four highest pr jets only; or to all jets, but starting from the second-highest px one. 

14 We caution the reader that the definition of Mrrgen cannot be found in the published version of Ref. [32] 
— the MrTgen discussion was added in a recent replacement on the archive, which appeared more than two 
years after the original publication. 
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Figure 15: The same as Fig. 0, but now in addition to the true yfs (yellow shaded) and v^roin 
(black) distribution, we also plot the distributions of 2M Tgen (red dots), 2M TTgen (magenta dots), 
Ht (green dots) and fix (blue dots), all calculated at the RECO-level. All results include the full 
simulation of the underlying event. For plotting convenience, the fix distribution is shown scaled 
down by a factor of 2. The vertical dotted line marks the tt mass threshold M p — 2m t = 350 GeV. 



Based on the results from Fig. 15, we can now address the question, which inclusive 
distribution shows the best correlation with the parent mass scale (in this case the parent 
mass scale is the tt mass threshold M p = 2mt = 350 GeV marked by the vertical dotted 
line in Fig. |l5|). Let us begin with the two variables, Ht an d Ht, which do not depend 



on any unknown mass parameters. Fig. 15 reveals that the Ht distribution peaks very far 
from threshold, and therefore does not reveal much information about the new physics mass 
scale. Consequently, any attempt at extracting new physics parameters out of the missing 
energy distribution alone, must make some additional model-dependent assumptions [72]. On 
the other hand, the Ht distribution appears to correlate better with M p , since its peak is 
relatively close to the tt threshold. However, this relationship is purely empirical, and it is 
difficult to know what is the associated systematic error. 

Moving on to the variables which carry a dependence on a missing mass parameter, 
V^mirT ' ^^Tgen and 2MTTgen, we see that all three are affected to some extent by the 
presence of the UE. In particular, the distributions of 2Mr 9en and 2MTTgen are now smeared 
and extend significantly beyond their expected endpoint (|7.3[). Not surprisingly, the UE has 
a larger impact on 2MT ge n than on 2MTTgen- In either case, there is no obvious endpoint. 
Nevertheless, one could in principle try to extract an endpoint through a straight-line fit, for 
example, but it is clear that the obtained value will be wrong by a certain amount (depending 
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Figure 16: The same as Fig. but for the gluino pair production example from Sec. |[ with each 

We use the light SUSY mass spectrum from Fig. |lO](a). The 

1200 GeV. 



gluino decaying to 4 jets as in (5.1) 
vertical dotted line now shows the gg mass threshold M p 



on the chosen region for fitting and on the associated backgrounds). All these difficulties with 
2MT g en and 2MxTgen are simply a reflection of the challenge of measuring a mass scale from 
an endpoint as in ( |7.3| ), instead of from a peak as in fl4.7p . By comparison, the determination 



(reco) 



distribution is much more robust. As shown 



of the new physics mass scale from the \fs, 
in Fig. 15, the s/s^f 1 P ea k is barely affected by the UE, and is still found precisely in the 
right location. 

All of the above discussion can be directly applied to the SUSY examples considered in 
Sec. U as well. As an illustration, Figs. |16] and [17] revisit two of the gluino examples from 
Section ^. In both figures, we consider gluino pair-production with a light SUSY spectrum 
(m^o = 100 GeV, m^o = 200 GeV and rrig = 600 GeV). Then in Fig. 16 each gluino decays 
to 4 jets as in eq. (5.1), while in Fig. [l7] each gluino decays to 2 jets as in eq. (|5.2|) . (Thus 
Fig. [u] is the analogue of Fig. |lO|(a), while Fig. 17 is the analogue of Fig. pT|(a).) 

The conclusions from Figs. |l6| and 17 are very similar. Both figures confirm that flx 
is not very helpful in determining the gluino mass scale M p = 2rrig = 1200 GeV (indicated 
by the vertical dotted line). The Ht distribution, on the other hand, has a nice well-defined 
peak, but the location of the Ht peak always underestimates the gluino mass scale (by about 
250 GeV in each case). Figs. |1] and 17 also confirm the effect already seen in Fig. [l5|: that the 
underlying event causes the 2My 9en and 2MTTgen distributions to extend well beyond their 
upper kinematic endpoint, thus violating ( |7.3[ ) and making the corresponding extraction of 
M„ rather problematic. In fact, just by looking at Figs. 16 and Il7], one might be tempted to 
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Figure 17: The same as Fig. |l(| but with each gluino decaying to 2 jets as in (5.2). Compare to 
Fig. 0(a). 



deduce that, if anything, it is the peak in 2MT ge n that perhaps might indicate the value of the 
new physics mass scale and not the 2Mr 9en endpoint. Finally, the yfs min distribution also 
feels to some extent the effects from the UE, but always has its peak in the near vicinity of 
M p . Therefore, among the five inclusive variables under consideration here, y/s, 
to provide the best estimate of the new physics mass scale. The correlation (^? 
in this paper is seen to hold very well in Fig. 17 and reasonably well in Fig. WB. 
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8. Summary and conclusions 

Since the original proposal of the \fs min variable in Ref. [1], its practicability has been called 
into question in light of the effects from the underlying event, in particular initial state 
radiation and multiple parton interactions. In this paper we proposed two variations of the 
y/~s min variable which are intended to avoid this problems. 

1. RECO-level ^/sl™ . The first variant, the RECO-level variable y/s^f 1 introduced 
in Sec. |2[ is basically a modification of the prescription for computing the original 
\fs min variable: instead of using (muon-corrected) calorimeter deposits, as was done 
in [1,51], one could instead calculate \fs min with the help of the reconstructed objects 
(jets and isolated photons, electrons and muons). Our examples in Sections [|, [B| and 
^ showed that this procedure tends to automatically subtract out the bulk of the UE 
contributions, rendering the v^min^ variable safe. 



2. Subsystem \/s^^ . Our second suggestion, discussed in Sec. [3|, was to apply \/s min to 
a subsystem of the observed event, which is suitably defined so that it does not include 
the contributions from the underlying event. The easiest way to do this is to veto jets 
from entering the definition of the subsystem. In this case, the subsystem variable 
V^min * s completely unaffected by the underlying event. However, depending on the 
particular scenario, in principle one could also allow (certain kinds of) jets to enter the 
subsystem. As long as there is an efficient method (through cuts) of selecting jets which 
(most likely) did not originate from the UE, this should work as well, as demonstrated 
in Fig. H with our ti example. 

Being simply variants of the original \/s min variable, both y/s^^ an d yf^m^n au t°- 
matically inherit the many nice properties of y/s min : 

• Both yfs^^f 1 an d ^fslrdn have a clear physical meaning: the minimum CM energy 
in the (sub)system, which is required in order to explain the observed signal in the 
detector. 

• Both \/~Smin°^ an d \fSmin are defined in a manifestly 1+3 Lorentz invariant way. As 
a consequence, their definitions utilize the available information about the longitudinal 
momentum components of the particles observed in the detector. 

• Both v^mirf^ an d V^min^ can ^ e computed by simple analytical formulas, eqs. ( 2.13 , 2.14 ) 
and ( |3.5[]3.8| ), correspondingly. 

• \/s^^° ( an d to some extent y/s^n) is a general, global, and inclusive variable, which 
can be applied to any type of events, regardless of the event topology, number or type 
of reconstructed objects, number or type of missing particles, etc. For example, all of 
the arbitrariness associated with the number and type of missing particles is encoded 
by a single parameter ])H. 

• The most important property of both ^™ ) ' and v^mirf * s * na ^ they exhibit a peak 
in their distributions, which directly correlates with the mass scale M p of the parent 
particles. In this regard we remind the reader that, compared to a kinematic endpoint, 
a peak is a feature which is much easier to observe and subsequently measure precisely 
over the SM backgrounds. This point was specifically illustrated in Sec. |7|, where we 
contrasted the observability of the peak in the v^min distribution to the observability 
of the endpoints of the 2Mr gen and 2MxTgen distributions. 

At the same time, compared to the original calorimeter-based y/s min variable considered 
in Ref. [1], the new variables y/s^^° an d yf^lrdn proposed here have one crucial advantage: 
they have very little sensitivity to the effects from the underlying event (ISR and MPI). As 
a result, the measurement of the corresponding mass scale from the peak in the distribution 
°f V^min^ or \^min * s r °bust and physically meaningful. 

In conclusion, we have shown that the variables y/s^^ and yfs^n nave certain im- 
portant advantages, and we feel that the experimental collaborations at the Tevatron and 
the LHC can only benefit from including them among their arsenal of observables. 
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